XtremWeb & Condor sharing resources between Internet connected Condor pools

نویسندگان

  • Oleg Lodygensky
  • Gilles Fedak
  • Franck Cappello
  • Vincent Néri
  • Miron Livny
  • Douglas Thain
چکیده

Grid computing presents two major challenges for deploying large scale applications across wide area networks gathering volunteers PC and clusters/parallel computers as computational resources: security and fault tolerance. This paper presents a lightweight Grid solution for the deployment of multi-parameters applications on a set of clusters protected by firewalls. The system uses a hierarchical design based on Condor for managing each cluster locally and XtremWeb for enabling resource sharing among the clusters. We discuss the security and fault tolerance mechanisms used for this design and demonstrate the usefulness of the approach measuring the performances of a multi-parameters biochemistry application deployed on two sites: University of Wisconsin/Madison and Paris South University. This experiment shows that we can efficiently and safely harness the computational power of about 200 PC distributed on two geographic sites.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A worldwide flock of Condors: Load sharing among workstation clusters

Condor is a distributed batch system for sharing the workload of compute-intensive jobs in a pool of Unix workstations connected by a network. In such a Condor pool, idle machines are spotted by Condor and allocated to queued jobs, thus putting otherwise unutilized capacity to e cient use. When institutions owning Condor pools cooperate, they may wish to exploit the joint capacity of their pool...

متن کامل

Leveraging HTC for UK eScience with Very Large Condor pools: Demand for transforming untapped power into results

We provide an insight into the demand from the UK eScience community for very large High Throughput Computing resources and provide an example of such a resource in current production use: the 930-node eMinerals Condor pool at UCL. We demonstrate the significant benefits this resource has provided to UK eScientists via quickly and easily realising results throughout a range of problem areas. We...

متن کامل

Implementation of Decentralized Load Sharing in Networked Workstations Using the Condor Package

In recent years a number of load sharing (LS) mechanisms have been proposed or implemented to fully utilize system resources. We have designed and implemented a decentralized real-time LS mechanism based on the Condor package 17, 18]. Two important features of our design are use of region-change broadcasts in the information policy to provide each workstation with timely state information at mi...

متن کامل

Condor flocking : load sharing between pools of workstations Report 93 - 104 X .

A selection of these reports is available in PostScript form at the Faculty's anonymous ftp-site. They are located in the directory /pub/publications/tech-reports at ftp.twi.tudelft.nl

متن کامل

Making Workstations a Friendly Environment for Batch Jobs

As time-sharing machines are replaced by powerful desktop computers and farms of workstations replace mainframes, more and more users turn to workstations when they need CPU cycles for their batch jobs. Unfortunately, they do not find workstations a very friendly environment for batch processing. Since these types of machines were originally designed as a single user environment, they lack most...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003